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[Title of Invention]: -y. ™ - — \ ' 

Variable Rate Voice-Coding Apparatus 

r 

[Abstractly - .; .1 J^"?^'] 

[PURPOSE] This iftVehtion is concerned with a variable rate voice-coding 

apparatus for coding voice signals in which the coding rate is varied depending on the characteristics 
of the voice signals. And the invention, particularly, aims at enabling to realize an economical 
configurations for such voice-coding apparatuses. by offering a simple algorithm. 
[CONSTITUTION] Provided is a detector 1 counting the number of occurrences of 

zero-crossing associated with a voice signal and compares it with a threshold. When it is larger, the 
concerned period is judged to be a consonant sound and otherwise the period is judged to be a vowel 
sound, a delaying circuit 2 causing a time delay corresponding to the time required for the detector 1 
to perform the processes, and a plurality of coding units 3, 4, receiving the voice signal through the 
delaying circuit, coding units 3,4 perform coding processes of which coding rates are mutually 
different. 

[Claims] 

[Claim 1] A variable rate voice-coding apparatus which is so configured that the vowel and 

consonant periods of a voice signal are distinguished from each other and the voice signal within the 
consonant period is coded at a lower rate than the rate at which the voice signal within the vowel 
period is coded. 

[Claim 2] The variable rate voice-coding apparatus according to claim 1, comprising: 

a detector (1) for detecting the vowel and consonant periods of the voice signal based on 

the number of occurrences of zero-crossing associated with the voice signal; 

a delaying circuit (2) associated with a delaying period corresponding to the period the 

detector (1) takes for completing the detecting process; and 

a plurality of coding units (3, 4) associated with mutually different coding rates, of which 

one is selected by a detector signal issued by the detector (1), and to which the voice signal is 

inputted via the delaying unit (2). 
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[Detailed Description of the Invention] 

(0001] 

[Field of the Invention] The invention is concerned with a variable rate voice-coding 
apparatus in which a voice signal is coded at a rate depending on the characteristics of the voice 
signal. A number of methods for band-compression-coding of a voice signal are known for the 
objective of efficiently using transmission media. A typical example of such apparatuses is the one, 
which is configured to include steps of sampling an analog voice signal at 8 kHz, converting the 
sampled data to a 12 - 16 bit digital signal, band-compressing it to a 8 bit signal according to a 
compression-decompression rule such as the A-rule or the //-rule, and transmitting it at a speed of 64 
kb/s. A separately developed communication technology, such as me ATM technology, enables 
dynamically changing the communication speed between a pair of communicating terminals, so that 
it becomes possible, in association with a digital voice signal, to transmit only the voice signal 
within periods in which any sound is really present, in a cell (packet) form. In these methods, it is 
more efficient if the rate of voice signal coding is changed dynamically based on the characteristics 
of the voice signal than coding a voice signal at a constant coding rate, while this needs to be 
achieved by an economical configuration. 
(0002] 

[Description of the Prior Art] Variable rate voice-coding apparatuses according to prior 

arts may be represented by the apparatus example shown in Fig. 5, which comprises a 4-bit- ADPCM 
coding unit 51, a 3-bit-ADPCM coding unit 52, and a quality assessing unit 53. In regard to other 
numerical indicators shown in the drawing, 54, 55 are delaying circuits, 56 is a selector, 57 is a 
packet composing unit, 61 is a packet decomposing unit, and 62, 63 are ADPCM decoding units. 
[0003] A PCM signal constituting an input voice signal is introduced to the ADPCM coding units 
51, 52 as well as to the quality assessing unit 53. The ADPCM units 51, 52 convert the PCM signal 
to 4-bit and 3-bit ADPCM (adaptive differential pulse code modulation) signals respectively and 
send them to the selector 56 via the delaying circuits 54, 55. The quality assessing unit 53 controls 
the operation of the selector 56 so that one of the 4-bit and 3-bit ADPCM signals is selected to arrive 
at the packet composing unit 57. The packet composing unit 57 composes a packet of an ADPCM 
signal having a header containing a selection information, destination address and so on and sends 
the packet to the receiver side via a network such as a packet exchange network. In the case of this 
particular example, if the inputted voice in the PCM signal form is at a speed of 64 kb/s (i.e., 8 kHz 
for the sampling rate with an 8-bit construction), it becomes a signal of 32 kb/s bit rate or a signal of 
24 kb/s bit after being coded to a 4-bit ADPCM signal or a 3 -bit ADPCM signal, respectively. 
[0004] The quality assessing unit 53 receives both the 4-bit and 3-bit ADPCM signals from the 
coding units 51, 52 and decodes them in the unit of a frame and once in a certain number of frames 
(for instance, once in 90 samples: 11.25 ms), and derives the quantization-associated signal-to-noise 
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ratio (SNR) based on the inputted voice PCM signal. The quality assessing unit 53 controls operation 
of the selector 56, in a manner so that the selector 56 selects a lower bit rate if it is found to satisfy a 
pre-assigned threshold for the coding quality assessment. The delaying circuit 54, 55 are there for 
delaying the timing of inputting ADPCM signals received from the ADPCM coding units 51, 52 to 
the selector 56 by a period corresponding to the time taken for the quality assessing unit 53 
performing the relevant processes. 

[0005] At the receiving side of the communication, the packet decomposing unit 61 decomposes a 
received packet to retrieve a corresponding ADPCM signal and identify which of the 4-bit or 3 -bit 
ADPCM signal it is from the additional information, and has the ADPCM signal decoded back to a 
voice PCM signal at one of the 4-bit and 3-bit ADPCM decoding units 62, 63 which the packet 
decomposing unit selectively activates. In an apparatus in which the ATM technology is employed, a 
signal is transmitted in a cell form, and for this end, said packet composing unit 57 and decomposing 
units 61 are replaced by cell composing and cell decomposing units respectively and a cell is 
composed of, for instance, a 5-byte cell header carrying information such as a logic pass and a logic 
channel, and a 48 byte information field for carrying information such as an ADPCM signal. 
[0006] 

[Problems to be solved by the Invention] A variable rate voice-coding apparatus 

according to a prior art, such as the above described example, is capable of not only achieving a 
certain voice quality in a reliable manner but also achieving some sort of reduction in the average bit 
rate. It is, however, necessary to have at least two of the ADPCM coding units 51, 52 operating 
simultaneously and to accept the quality assessing unit having a complex configuration operating in 
a complex manner. In particular, the apparatus needs to be large in terms of the hardware size so as 
the installation costs become high, and in the case of it being associated with an operation under 
control of a processor such as a digital signal processor (DSP), it is necessary to have a program with 
a large number of steps which slow down the processing. 

The invention makes it the objective to offer a variable rate voice-coding apparatus which is 
economical to configure and operate because of the simplicity of the algorithm adopted for the 
apparatus. 
[0007] 

[Means for solving the problem] A variable rate voice-coding apparatus of the present 
invention may be explained in association with the Fig.l. The apparatus is associated with a 
configuration in which a voice signal is classified into a vowel period and a consonant period and a 
lower coding rate is employed for coding the voice signal within the consonant period than the 
coding rate employed for the voice signal within the vowel period. 

[0008] The solution to the problem may also be attained by a variable rate voice-coding apparatus 
configuration which comprise: 
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a detector (1) for identifying the vowel and consonant periods of the voice signal based on 
the number of occurrences of zero-crossing with the voice signal; 

a delay circuit (2) associated with a delaying period corresponding to the period the 
detector (1) takes for completing the identifying processes; and 

a plurality of coding units (3, 4) associated with mutually different coding rates, of which 
one is selected by a detector signal issued by the detector (1), and to which the voice signal is 
inputted via the delaying unit (2). 
[0009] 

[Operation] A consonant sound of a voice signal, in its original form, is associated with a 

turbulent flow of air generated by an outward breath passing through a narrowed vocal cords and, for 
this reason, is constituted mainly from noise like sound elements. It is known that the quality and 
clarity of a reproduced sound, as a whole, is not affected very much by the a reduced quality of 
reproduction of the consonant portion. This means that it is allowed to have a larger error being 
entrained in quantumizing a voice signal of a consonant period than of a vowel period. This fact 
indicates that applying a lower coding rate to a consonant period voice signal than the coding rate 
applied to a vowel period voice signal may be possible for the purpose of achieving a required 
degree of the overall bit-rate reduction without impairing the over all voice quality. 
[0010] 

It is also known to us that the number of occurrences of zero-crossing is larger for a consonant 
period than for a vowel period of a voice signal and this distinction is usable in classifying a voice 
signal between the consonant and vowel periods, by counting the number of polarity inversions 
occurring with and inputted PCM signal to the detector, and comparing this with pre-assigned 
threshold. The result of the identification process performed by the detector (1) is linked to the 
selection and activation of only one of the coding units 3, 4. 
[0011] 

[Embodiments of the Invention] Fig.2 is an explanatory drawing associated with an 
embodiment of the present invention, in which 11 indicate a detector unit identifying the distinction 
between vowel and consonant portions of a voice signal based on the number of occurrences of the 
zero-crossing, 12 indicates a delaying circuit, 13 indicates a 4-bit ADPCM coding unit, 14 indicates 
a 3-bit ADPCM coding unit, 15 indicates a packet composing unit, 21 indicates a packet 
decomposing unit, 22 indicates a 4-bit ADPCM decoding unit and 23 indicates a 3-bit ADPCM 
decoding unit. 

[0012] The detector 11 counts the number of zero-crossing occurring in each period of a 
pre-assigned duration in association with a inputted PCM signal and compares it with the number to 
a pre-assigned threshold value. The number of occurrences of zero-crossing can be obtained, for 
instance, by counting the number of polarity inversions taking place with the PCM signal. 
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Consonant sounds are constituted by voice elements being dominated by noise-like voice elements 
as explained earlier, and therefore, associated with a larger number of zero-crossing ocurrences. It is, 
therefore, possible to judge a period as the period belonging to a consonant sound if the number of 
zero-crossing occurrences within the pre-assigned period of time is large. As a reduction in the 
reproducing quality of a consonant sound does not affect very much the clarity of a over all 
reproduced voice, the apparatus can be configured in such a manner that the voice signal within the 
consonant periods is coded by duly selected and activated 3-bit ADPCM coding unit 14, the lower 
bit-rate unit of the available two. The information on this result of the ADPCM selection is inputted 
to the packet composing unit 15 to have the packet composing unit 15 formulate the information into 
the header information of a packet. Conversely, if the number of zero-crossing occurrences is small 
and a period is judged to be a vowel period, the higher bit rate unit, 4-bit ADPCM coding unit 13 is 
selected and activated, and this information on the coding unit selection result is inputted to the 
packet composing unit to have the packet composing unit to formulate the information into the 
header information of a packet. 

[0013] The delaying circuit 12 is associated with the delaying by a time period corresponding to 
the time the detector 1 1 takes to complete its processes so that the voice PCM signal is inputted to 
one of the ADPCM coding units at a time matched to the time of process completion in the detector 
11, and the voice PCM signal belonging to the vowel periods is coded to the 4-bit ADPCM signal 
and the voice PCM signal corresponding to the consonant periods is coded to the 3 -bit ADPCM 
signal. Either the 4-bit or 3-bit PCM signal is inputted to the packet composing unit 15 and 
composed into a packet which has a header containing an address, auxiliary information, etc. and is 
transmitted to the receiver side via a network such as a packet exchange network. 
[0014] At the receiving side, the packet decomposing unit 21 decomposes received packets and 
has them decoded at one of the 4-bit and 3-bit ADPCM decoding units 22, 23 selected accordingly to 
the auxiliary information to obtain the voice PCM signal of the vowel periods from the 4-bit 
ADPCM signal and the voice PCM signal of the consonant periods from the 3-bit ADPCM signal. 
[0015] Fig.3 explains about the consonant and vowel portions of a voice signal, in which the 
drawing indicated by (a) is an example of analog voice signals and the drawing indicated by (b) is an 
waveform example of vowel sounds. As shown in (b), if the number of zero-crossing points (ZCs) in 
a pre-assigned period Tl is 8, for consonant, it is more than several tens. The distinction between the 
consonant and vowel period, thus, can be determined by counting the number of ZCs. Each ZC, as 
shown in the drawing, is a point at which the polarity of a voice PCM signal is inversed and 
therefore the number of ZCs is obtained by counting the polarity inversions associated with the voice 
PCM signal. The voice signal is coded into an ADPCM signal, as describe above, i.e., by the 4-bit 
ADPCM coding unit 13 when it is of a vowel period and by the 3-bit ADPCM coding unit 14 when 
it is of a consonant period. A voice signal, is coded at a higher-bit rate when it is in vowel periods 
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and at a lower-bit rate when it is in consonant periods without reducing the clarity of a sound for the 
purpose of improving the transmission efficiency. 

[0016] Fig.4 explains another embodiment of the present invention. 31 indicates a detector, 32 
indicates a delaying circuit, 33 indicates a 4-bit ADPCM coding unit, 34 indicates a 3-bit ADPCM 
coding unit, 35 indicates a packet composing unit, 36 indicates a 2-bit ADPCM coding unit, 41 
indicates a packet decomposing unit, 42 indicates a 4-bit ADPCM decoding unit, 43 indicates a 3-bit 
ADPCM decoding unit, and 44 indicates a 2-bit ADPCM decoding unit. 

[0017] This embodiment is concerned with a configuration in which a voice signal is classified 
into that of no-sound periods, of vowel periods, and of consonant periods and coded at mutually 
different coding rates. The detector 3 1 distinguishes no-sound periods of a voice signal in addition to 
distinguishing the previously described periods, i.e., consonant and vowel periods. The distinction 
between a no-sound period and a sound period (either of the consonant and vowel periods) becomes 
possible by determining a property, such as the level, of a voice PCM signal. 
[0018] One of the 4-bit, 3-bit, and 2-bit ADPCM coding units 33, 34, 36 is selectively activated 
based on the selection information resulting from the processes performed by the detector 31. For 
instance, when the selection information indicates a vowel period, 4-bit ADPCM coding unit 33 is 
selectively activated and the voice PCM signal inputted from the delaying circuit 32 is coded into a 
4-bit ADPCM signal. When the selection information indicates a consonant period, 3 -bit ADPCM 
coding unit 34 is selectively activated and the voice PCM signal inputted from the delaying circuit 
32 is coded into a 3 -bit ADPCM signal. 

[0017] And when the selection information indicates a no-sound period, 2-bit ADPCM coding 
unit 36 is selectively activated and the voice PCM signal (the background noise signal) inputted 
from the delaying circuit 32 is coded into a 2-bit ADPCM signal. A background noise signal 
corresponding to a no-sound period does not require to be of any specific signal clarity so that it is 
sufficient to code it at the lowest-bit rate. The packet composing unit 35 composes packets from the 
ADPCM signals and selection information received from the detector 31, and transmit them to the 
receiving side via a network such as a packet exchange network. 

[0020] At the receiving side, the packet decomposing unit 41 decomposes received packets and 
selects one of the 2-bit, 3-bit, and 4-bit ADPCM decoding units 42, 43, 44 to have it decode and 
obtain a decoded voice PCM signal, the parts corresponding to vowel periods from the ADPCM 
decoding unit 42, the parts corresponding to consonant periods from ADPCM decoding unit 43, and 
the background noise PCM signal parts, corresponding to no-sound periods from the ADPCM 
decoding unit 44. 

[0021] Above described embodiments are concerned with configurations in which a voice signal 
is coded into ADPCM signals. It is possible, however, to code a voice signal with various other 
coding methods. It is also possible to configure so that no signal is transmitted for periods 
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corresponding to the no-sound period of a voice signal in contrast to the embodiment shown in Fig.2 
in which a voice signal is coded into a 3-bit ADPCM signal while it is of consonant and no-sound 
periods, and to the embodiment shown in Fig.4, in which a voice signal, while it is of no-sound 
periods, is coded into a 2-bit ADPCM signal. 
[0022] 

[Effect of invention] As described above, the present invention is concerned with 

configurations in which the coding rate of a voice signal employed for a period is changed between 
vowel periods and consonant periods. And it is further concerned with configurations in which the 
distinction between vowel and consonant periods of a voice signal is determined in an easy way 
which relies on the number of zero crossing occurrences counted in associated with a voice signal. 
According to the configuration of the present invention, the associated power consumption is 
reduced from that associated with a prior art configuration because it is configured in a simpler 
manner and a plurality of the coding units 3, 4 is activated only selectively in contrast to a 
configuration associated with a prior art method of evaluating the property of a voice signal. 
Additionally, it becomes possible to configure a variable-rate coding apparatus having a higher speed 
processing capability as the algorithm on which the present invention rely is so simple that, even 
when employing a digital signal processor for further processing a voice signal, the associated 
program becomes simpler. 
[Brief explanation of drawings] 
[Fig. 1 ] Principles of the present invention 
[Fig.2] Embodiment of the present invention 
[Fig. 3] Explanation of consonant and vowel period of a voice signal 
[Fig.4] Another embodiment of the present invention 
[Fig.5] Embodiment according to a prior art technology 
[Explanation of codes] 

1 : Detector 2: Delaying circuit 3: Coding unit 4: Coding unit 
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